Nucleosome shape dictates chromatin-fiber structure 



Martin Depken/'^'^* Helmut Schiessel^ 
^ Max Planck Institute for the Physics Complex Systems, 
Nothnitzer StraBe 38, 01187 Dresden, Germany 
^Max-Planck Institute for Molecular Cell Biology and Genetics, 

Pfotenhauerstr. 108, 01307 Dresden, Germany 
^Instituut-Lorentz for Theoretical Physics, Universiteit Leiden, 
Postbus 9506, 2300 RA Leiden, The Netherlands 
*To whom correspondence should be addressed; E-mail: depken@pks.mpg.de. 



Apart from being the gateway for all access to the eukaryotic genome, chro- 
matin has in recent years been identified as carrying an epigenetic code regu- 
lating transcriptional activity. The detailed knowledge of this code contrasts 
the ignorance of the fiber structure which it regulates, and none of the sug- 
gested fiber models are capable of predicting the most basic quantities of the 
fiber (diameter, nucleosome line density, etc.). We address this three-decade- 
old problem by constructing a simple geometrical model based on the nucleo- 
some shape alone. Without fit parameters we predict the observed properties 
of the condensed chromatin fiber (e.g. its 30 nm diameter), the structure, and 
how the fiber changes with varying nucleosome repeat length. Our approach 
further puts the plethora of previously suggested models within a coherent 
framework, and opens the door to detailed studies of the interplay between 
chromatin structure and function. 

Eukaryotic DNA is wrapped around histone proteins, resulting in a string of wedge-shaped 
nucleosomes connected via short stretches of linker DNA. Under physiological salt concen- 
trations this string can undergo additional folding, forming what is referred to as the 30 nm 
fiber. Whereas the structure of the nucleosome is known to atomic resolution (1), the three- 
dimensional arrangement of nucleosomes in the 30 nm fiber remains poorly understood — 
despite three decades of experiments and model building (|2]). The wide range of models docu- 
mented in the literature can be divided into roughly two classes: the traditional solenoid mod- 
els Q and the crossed linker models (4). Unfortunately, neither type provide criteria to identify 
the optimal fiber geometry, and basic quantities like the fiber diameter are ultimately fixed by 
the fine tuning of unknown parameters. This lack of predictive power and the fact that many 
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experiments (|^|^[^|^|9]) were likely performed on amorphous samples are arguably the 
main reasons why the structure has remained unresolved for so long. Some of the experimental 
issues have recently been overcome through the reconstitution of highly regular fibers ( jii||i2| ), 
but a comprehensive modeling effort determining the structure is still lacking. We take in- 
spiration from previous success of geometric arguments when applied to information-carrying 



structures (13) and address this problem by considering all fibers satisfying the geometric con- 
dition that the nucleosome core particles (NCPs) pack densely on the periphery of the fiber Q 
(see Fig. 1 a). Without free parameters this enables us to make definite predictions that are all 



born out when compared to the new regular reconstituted fibers (11 12). Our approach makes 
the implicit assumption the short range attraction between NCPs (14 T5\\T6^ constitute the dom- 
inant mode of interaction in dense chromatin fibers, being only weakly modulated by the soft 
contribution from the DNA-linker backbone. Taking this view, the problem of determining the 
structure of the chromatin fiber splits into two parts: the identification of dense configurations 
of NCPs on the periphery of the fiber, and the estimation of the energetic contribution from the 
linker backbone in order to determine which structure is realized. 

First addressing the formation of a dense shell we note that NCPs aggregate into arcs in 



solution (17), indicating that their wedge-shaped form Q can play a key role in dictating large 
scale arrangements of interacting nucleosomes. Drawing on this, we take the ejfective shape 
of the NCP as being that of a wedge shaped cylinder (see Fig. 1 b). In Figure 1 c we show 
how a dense packing of nucleosome footprints on a periodic strip is a necessary condition for a 
dense three-dimensional packing of nucleosomes in the fiber. By assuming a dense packing the 
footprints are forced into helical ribbons winding along the fiber. These are formed along either 



of the footprints symmetry axes. Models normally referred to as interdigitated ( [77] belong 
to the set of dense packings where the ribbons form along the major axis (NCPs stacking side 
to side). Here we assume the NCPs to stack face to face, corresponding to footprints forming 
ribbons along their minor axis (see Supplementary note for the modifications to the below in 
the case of interdigitated structures). Such stackings of NCPs aggregate spontaneously under 
the right solvent conditions ( [7^[79l ), and is also what best utilizes any short range attractive 
interaction. For a dense footprint packing, the nucleosome line density (NLD) a is simply the 
width of the strip onto which they pack, divided by the foot print area, 

n(D - a) 

. ■ (1) 

ab 

The manner in which the ribbons spiral up along the fiber, parameterized by the ribbon angle 7, 
is set by the requirement that the A^rib ribbons precisely fill up the periodic strip (see Fig. 1 c), 

iVrib— = 7r(D-a). (2) 
cos 7 

In addition we require that the backbone connects all the nucleosomes in a regular fashion. 
Denote by A^stcp the distance across ribbons between connected nucleosomes (see Fig. 1 a). The 
necessary and sufficient condition for a regular backbone winding (BW) — completely defined 
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by the pair (A'rib, A'step) — is the existence of two integers n and k with < n < A; < A^rib such 
that 

fcAT^tep - nA^rib = 1. (3) 

Equation 3 ensures that neighboring ribbons are eventually connected (after k steps and n turns, 
see Supplementary note) and hence all ribbons are visited by the backbone (c.f. (4, 2), which 
only connects half of the NCPs). The trivial BW (A^rib, 1) corresponds to the backbone con- 
necting nucleosomes in neighboring ribbons (since A^'step = 1, see Fig. 1 a). Such a backbone 
can be found for fibers with any number of ribbons since with n = and k = \ condition Equa- 
tion 3 is always satisfied. The classical solenoid model (jJj) has a (1, 1) BW, and all the models 
considered by Wong et al. (20) have trivial BWs. By scanning through the finite number of 
possible n's and k's one finds all additional non-trivial BWs, extending the set of crossed-linker 
models to (5,2), (7,2), (7,3), (8,3), and so on. Thus this approach exhaustively covers all 
major contending models for the fiber structure (|5||7 21 ^0|2]) (solenoid models, crossed linker 



models, interdigitated models, etc.), including some specific models not previously considered, 
and puts them firmly within a coherent framework. 

Returning to the full three-dimensional packing of nucleosomes we note that all admissible 
footprint packings correspond to NCPs packed together with different effective wedge angles 
(see Fig. 1 b). Through considering the curvature along the ribbons it is straightforward to relate 
the effective wedge angle to the fiber diameter (see Supplementary note). Within the relevant 
parameter ranges the exact expression can be approximated as 
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with an accuracy of a couple of tenths of a degree. For any specific fiber diameter D and number 
of ribbons A^rib this directly gives the effective wedge angle, and it can easily be inverted to give 
the possible fiber diameters for any specific effective wedge angle. In Figure 2 a we show 
how the effective wedge angle varies with fiber diameter for fibers with up to ten ribbons. In 
what follows we will use the effective NCP diameter a = 11.5 nm and average height b = 6.0 



nm as deduced for the close packings of NCPs into columnar quasi-hexagonal crystals (19) 
under physiological salt concentrations and moderate pressures. We are ultimately interested 
in the in vivo situation where there are additional linker histones present, bringing the in and 



out going DNA at each nucleosome into a stem structure (22) (see Fig. 1 b). Taking this into 
account, we require D > 2(Zstem + o.) in order to avoid steric interactions between stems on 
opposite sides of the fiber. Here Zstem = 3 nm is the length of the induced stem as measured 
by Bednar et al. (22). In Figure 2 b we illustrate all fibers and BWs with a diameter of 33 



nm. They include the solenoid model (1, 1) (^), the two-start helix (2, 1) (72) and the crossed 
linker model (5, 2) (4). It is clear from the number of possible structures that fixing the fiber 
diameter tells us little about the internal structure of the fiber, though it explains the wide range 
of models suggested in the literature; all made consistent with the experimental findings but with 
little predictive power. Instead we take a reductionist approach and enforce the microscopic 
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condition of optimal dense face-to-face stacking of nucleosomes. In experiments by Dubochet 
and Noll ( [77| ) unconstrained nucleosomal arcs were observed with the effective wedge angle 
a = 8° for the NCP repeat unit. This will be the effective wedge angle assumed throughout 
the rest of this paper. With this microscopic condition we directly get a discrete set of possible 
shell structures, three of which are shown in Figure 2 c (structures A, B, and C), and all of 
which are clearly distinguished from each other on the level of the fiber diameter and NLD (see 
Table 1). Here we do not discuss the very wide fibers, one of which is displayed in the inset of 
Figure 2 a. These structures might never be realized in chromatin, but are similar to the gigantic 
tubes of NCPs observed by Dubochet and Noll (17). The results of the simple assumption of a 
dense packing of nucleosomal wedges are summarized in Table 1, where we list all fibers with a 
fiber diameter up to 63 nm. As detailed below some of these have already been observed, while 
others might still be found through further experiments. 

Armed with a small set of possible shell structures, we now examine the linker backbone 
to determine which of these is realized for any specific nucleosomal repeat length. Though 
we lack a precise model for the energetics of the backbone, we can still put upper and lower 
bounds on the possible linker lengths for a specific shell structure and BW. The lower bound 
is set by the shortest distance between two successive stems along the backbone. This depends 
not only on the BW {N^i)^, Nstcp) but also on the relative helicity of backbone and ribbons. 
We denote structures where ribbons and backbone have the same helicity by (A^rib, ^stcp)^^ 
and by (Arib? ^stcp) in the opposite case. The upper limit for the linker length is set by the 
excluded volume constraint on the inside of the fiber. We assume that due to the presence of 
cationic histone tails the highly charged linker DNA can be hexagonally packed with a shortest 
centre-to-centre distance set by the DNA diameter c^dna = 2 nm. The resulting limits on the 
linker lengths are indicated in Table 1. We thus see that for the shortest repeat lengths the 
realized structure must always be (5, 1)^ or (5,2)^. These feature a 33 nm diameter, from 
which the 30 nm fiber derives its name. Of these two structures we expect (5,2)^ to be realized 
since it allows for the straightest linkers (see Fig. 2 c, structure A). When increasing the linker 
length the fiber must take on another structure before the maximum repeat length of 210 bp. 
The fact that (7, 3)''' (possible for repeat lengths over 207 bp) has the straightest linkers (see 
Fig. 2 c, structure C) makes it a good candidate for the target structure. Thus, through the very 
simple, geometric, and microscopic condition of an optimal nucleosome packing combined with 
rudimentary arguments concerning the backbone, we are able to make predictions concerning 
the precise structure realized for different nucleosomal repeat lengths. 

Having discussed the theoretically possible fibers for different repeat lengths, and identified 
a plausible transition point between structures, we now compare this with recent results on dense 
reconstituted fibers. Robinson et al. ([77|) observed that such fibers clustered into two sets, each 
signified by a specific fiber diameter and NLD (see Fig. 3). As pointed out by Wong et al. (20), 
a similar clustering is also seen for the native fibers examined by Williams et al. (j6|. Since the 
fiber diameter and the NLD are linearly related through condition Equation 1, we can use this 
as a direct test of our approach. In Figure 3 we plot this relation together with the observed fiber 
diameter and nucleosome line densities ([77]). Our model is consistent with the experimental data 
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and manages to account for both thin and thick fibers without any fit parameters. In Figure 2 we 
have indicated the average diameters of the two clusters observed by Robinson et al. pT[ ). The 
structures predicted by our model (Fig. 2 a and c, structures A and C) are the only fibers within 
the error bars of the experiments. It can also be seen that these predictions are rather robust 
against changes in the effective wedge angle. In addition, the transition between fiber structures 
is observed somewhere between repeat lengths of 207 and 217 bp, which is also captured by 
our model. Apparently contradicting these results is another recent set of experiments ( [72] ) 
suggesting a two-ribbon structure. These were performed on short fibers (10-12 nucleosomes), 
and would thus be unlikely to capture the structures suggested here (5 and 7 ribbons) since 
they only allow for around two nucleosomes to stack in each ribbon. In line with the basic 
assumption of our model, that nucleosome interactions drive the assembly of the fiber, we expect 
short fibers to favor fewer ribbons in order to minimize the number of nucleosome faces exposed 
to the solution. The same reasoning applies to inferring the arrangement of nucleosomes in the 
fiber from the crystallographic structure of the tetra-nucleosome {23) . Thus we conclude that 
our zero-fit-parameter model is consistent with all experimental findings on regular fibers to 
date. 

Our model could be further tested through e.g. linear dichroism studies determining 
the ribbon angle 7 for these newly available structures. It would also be interesting to see if 
structures like (8,3)^ or (9,4)='' are ever realized for repeat lengths longer than those inves- 
tigated by Robinson et al. (11}. Moving away from the 10 bp ladder (set by DNA's helical 
repeat length) used by Robinson et al. would further elucidate whether it is the bend or twist 
energy of the linker DNA that dictates the chosen structures. Our model also suggests that the 
predicted structures are insensitive to a certain amount of variations in the nucleosome repeat 
length. With the inclusion of a energetic model for the linker backbone the above development 
should form the basis for statistical and kinetic studies of how in vivo variations and correla- 
tion in repeat length (24 ) affect the locally realized structure, its stability, and thus the observed 
condensation-decondensation transition of the 30 nm fiber. In vitro, this transition can be probed 
by a change of ionic conditions (22 ), or the application of a sufficiently large external force, e.g. 
in a single-molecule-experiment using optical tweezers (25). In vivo, this can be done by the 
acetylation of histone tails { 26\\27^ , thus offering a straightforward way of increasing the acces- 
sibility to the packed genetic material. Understanding the structure of the fiber now opens the 
door to detailed study of this transition, and its connection to the histone code (28 29 27). Ulti- 
mately this structural knowledge should be combined with biochemical studies in order to move 
towards a comprehensive understanding of the subtle interplay between structure and function 
in chromatin dJOh. 
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Table 1: The predicted fiber structures. The table displays all the calculated properties of the 
fibers consistent with the structure of the nucleosome. They are (left to right): fiber structure, 
fiber diameter, nucleosome line density, angle formed between ribbons and fiber axis, minimum 
nucleosomal repeat length, and maximum nucleosomal repeat length. The minimum repeat 
length differs for different relative helicities between linker backbone and ribbons, the value 
for opposite relative helicity being indicated within parenthesis (for the case of at least one bp 
difference between the two). Empty fields take the values of fields directly above. 
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(9,1)+ 
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200(198) 
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Figure 1: The model of the dense chromatin fiber, a) In a dense cliromatin fiber the nucle- 
osomes pack on the outside of the fiber, and the linkers are situated on the inside. Here the 
{Nrih = 9, Nstep = 4) backbone is illustrated, completely specifying the way the nucleosomes 
are connected, b) Illustration of how three nucleosomes stack, and how this relates to the ef- 
fective wedge shape of the nucleosome (outlined). By using the term effective wedge shape 
we stress that the nucleosome itself need not form a perfect wedge, but rather that we rely on 
the experimental observation (I77| that when they aggregate one can identify a wedge shaped 
repeat unit. Onto the wedge shaped cylinder we attach a rigid stem (22) to represent the in and 
out going DNA and a linker histone (not shown), c) Illustration of how a dense nucleosome- 
footprint packing on a cylinder running through the nucleosome centers is a necessary condition 
for a dense three-dimensional structure. Also indicated are the ribbons induced by the dense 
packing, together with the angle 7 they make with the fiber axis. 
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Figure 2: The predicted fibers, a) Plot of how the effective wedge angle a varies with fiber 
diameter D (or, equivalently nucleosome line density a) for fibers with up to ten ribbons. Each 
curve is labeled as: number of ribbons - number of possible backbones. Indicated is also the 
effective wedge angle measured by Dubochet and Noll ( pT] ) and used in this paper to predict 
the fiber structures. A giant fiber solution with 87 nm diameter, and similar to the ones seen 
by Dubochet and Noll {17), is displayed in the inset. Further indicated are the average fiber 
diameters for the two different sets of fibers observed by Robinson et al. (11) (see also Fig. 3), 
with the area including one standard deviation indicated in grey, b) All solutions with different 
effective wedge angles achieved by fixing the fiber diameter to 33 nm (point 1-5 in Panel a), 
together with the available backbones, c) Some of the structures predicted by fixing the effective 
wedge angle to the value measured by Dubochet and Noll ( ^17) (points A-C in Panel a), together 
with the allowed backbones. The fibers with 5 (A) and 7 (C) ribbons come very close to the 
structures observed by Robinson et al. ([77]) for dense reconstituted fibers. 
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fiber diameter, Dfn, [nm] 

Figure 3: Experimental observations on dense fibers. Re-representation of data published by 
Robinson et al. (11) for reconstituted fibers with different nucleosomal repeat lengths (indi- 
cated). The data points are seen to cluster around two specific diameters, D =33 nm and 44 
nm, and nucleosome line densities, a =1.0 nm^ and 1.4 nm^. The error bars indicate one 
standard deviation. Also shown is the linear relation between fiber diameter and nucleosome 
line density predicted by our model Eq. 1. It is seen to be consistent with both thin and thick 
fibers, without any adjustable parameters. 
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Supporting Notes 




Figure SI: Footprint packing structure. The footprint packing defined in Figure 1 c of the main 
text, with lattice vectors p^^^ and p^^^^ indicated, together with the unit vectors 6^,6^., and e^ib- 

The relation defining the backbone winding: Building on the footprint packings displayed 
in Figure 1 of the main text, define the vector p^^^ as the vector connecting the nucleosome at 
its base (see Fig. SI) with the next nucleosome placed in the same ribbon which is encountered 
when moving along the backbone. In a similar manner define p^.^^ as ending at the next nu- 
cleosome encountered in the ribbon neighboring to the right. Let fctop be the number of steps 
taken along the backbone when going between the nucleosomes connected by p^^^, and ntop 
the number of times the fiber was circled in doing so. Define ksidc and Usidc in the analogous 
manner. Then we have 

hop = (^topA - n{D - a)ntop)eg + 

hide = (^topA - n{D - a)nside)ee + ^^e^, 

where a is the nucleosome line density along the fiber, and A is the circumferential distance 
between nucleosomes following each other along the backbone. For the packing of ribbons 
to be dense, the parallelogram spanned by these two vectors must be of the same area as the 
footprint, 

= E^ide ^ Etop = ~ a)(^topfcside - "'side/i;top)/0'- 

Using Equation 1 of the main text this becomes 

'^top^side '^side^top !• 

Before returning to the same ribbon, all ribbons traversed on the outside of the fiber in the first 
step has to be visited exactly once (ribbon 1,2, ... , Ngtcp — 1 of Fig. 1 a of the main text). This 
can only be done during successive turns around the fiber, and thus Ai'step = '^top- Also, before 
returning to the same ribbon, all other ribbons must have been visited exactly once, giving 
^top = Nj-ik, where A^^b is the number of ribbons in the fiber. Thus Equation 3 of the main text 
follows. 



12 



Approximate relationship for wedge angle: The curvature tensor for the cylinder surface 
defined in Figure 1 a of the main text is given in the orthonormal basis {eg, e^) (see Fig. SI) as 

and the unit vector aligned with the ribbons is given by 

e^ib = sin 7 eg + cos 7 e^. 
With this we can calculate the radius of curvature along the ribbons, i?rib, as 

1 2 sin' 7 

-L/ -Krib — Crib ■ ■ Crib — -j^ ■ 

JJ — a 

The wedge angle is now approximated by 

, 26 sin' 7 

a b/Rrib = — , 

D — a 

which by use of Equation 2 in the main text directly gives Equation 4 of the main text. 

Modifications for interdigitated models: For the interdigitated models Equation 1 and Equa- 
tion 3 stay the same, while Equation 2, and Equation 4 become 

A^rib— = AD - a) 
cos 7 



and 



respectively. 



rib 



7r{D - a) 
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